25. Data Preparation
Data Preparation
Data Preprocessing
QUIZ QUESTION: :
In the lessons and projects throughout this program, you've encountered various aspects of data preparation; balanced classes, randomization, train/test splits and others. Here's a quiz to review why these steps are necessary (drag and drop the colored boxes to the appropriate location in the right-hand column).
ANSWER CHOICES:
Data Preparation Step |
Purpose |
---|---|
To avoid having your algorithm simply classify everything as belonging to the majority class. |
|
To avoid problems due to ordering of the data |
|
To avoid individual features or sets of features dominating the response of your classifier |
|
To estimate generalization of the model to new data |
SOLUTION:
Data Preparation Step |
Purpose |
---|---|
To avoid having your algorithm simply classify everything as belonging to the majority class. |
|
To avoid problems due to ordering of the data |
|
To avoid individual features or sets of features dominating the response of your classifier |
|
To estimate generalization of the model to new data |